How Trustworthy Is Your Tree? Bayesian Phylogenetic Effective Sample Size Through the Lens of Monte Carlo Error

نویسندگان

چکیده

Bayesian inference is a popular and widely-used approach to infer phylogenies (evolutionary trees). However, despite decades of widespread application, it remains difficult judge how well given Markov chain Monte Carlo (MCMC) run explores the space phylogenetic trees. In this paper, we investigate error phylogenies, focusing on high-dimensional summaries posterior distribution, including variability in estimated edge/branch (known phylogenetics as “split”) probabilities tree probabilities, summary tree. Specifically, ask if there any measure effective sample size (ESS) applicable trees which capable capturing these three measures. We find that are some ESS measures inherent using MCMC samples approximate distributions phylogenies. term measures, identify set useful practice for assessing error. Lastly, present visualization tools can improve comparisons between multiple independent runs by accounting each chain. Our results indicate common post-MCMC workflows insufficient capture tree, highlight need both within-chain mixing between-chain convergence assessments.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses

Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC repres...

متن کامل

Phylogenetic effective sample size.

In this paper I address the question-how large is a phylogenetic sample? I propose a definition of a phylogenetic effective sample size for Brownian motion and Ornstein-Uhlenbeck processes-the regression effective sample size. I discuss how mutual information can be used to define an effective sample size in the non-normal process case and compare these two definitions to an already present con...

متن کامل

Bayesian Inference in Monte-Carlo Tree Search

Monte-Carlo Tree Search (MCTS) methods are drawing great interest after yielding breakthrough results in computer Go. This paper proposes a Bayesian approach to MCTS that is inspired by distributionfree approaches such as UCT [13], yet significantly differs in important respects. The Bayesian framework allows potentially much more accurate (Bayes-optimal) estimation of node values and node unce...

متن کامل

Determining the optimal sample size in the Monte Carlo experiments

A convergence criterion for the Monte Carlo estimates will be proposed which can be used as a stopping rule for the Monte Carlo experiments. The proposed criterion searches a convergence band of a given width and length such that the probability of the Monte Carlo sample variance to fall outside of this band is practically null. After the convergence to the process variance realized according t...

متن کامل

Phylogenetic tree construction using sequential stochastic approximation Monte Carlo

Monte Carlo methods have received much attention recently in the literature of phylogenetic tree construction. However, they often suffer from two difficulties, the curse of dimensionality and the local-trap problem. The former one is due to that the number of possible phylogenetic trees increases at a super-exponential rate as the number of taxa increases. The latter one is due to that the phy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bayesian Analysis

سال: 2023

ISSN: ['1936-0975', '1931-6690']

DOI: https://doi.org/10.1214/22-ba1339